DYNAMICAL PROPERTIES OF THE PASCAL ADIC TRANSFORMATION 
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Abstract. We study the dynamics of a transformation that acts on infinite paths in the graph associated 
with Pascal's triangle. For each ergodic invariant measure the asymptotic law of the return time to cylinders 
is given by a step function. We construct a representation of the system by a subshift on a two-symbol 
alphabet and then prove that the complexity function of this subshift is asymptotic to a cubic, the frequencies 
of occurrence of blocks behave in a regular manner, and the subshift is topologically weak mixing. 



1. Introduction 

Adic transformations were introduced by A. Vcrshik as combinatorial models of the cutting and stacking 
constructions familiar in ergodic theory [38, 39, 44]. They move sequences in a transverse manner to the 
usual shift transformation, much as the horocycle flow is transverse to the geodesic flow (cf. [34, 17]). An 
adic transformation acts on the space of infinite paths on an infinite graded graph, or Brattcli diagram, and 
the dynamics of these transformations can provide information about the associated C* algebras or families 
of group representations (see for example [37, 41, 42, 43]). Stationary adic transformations (in which after 
the first, or root, level all levels of the graph have the same number of vertices and the same pattern of 
connections to the next level) correspond to odometers and substitution subshifts [13, 21, 44, 16, 35, 36, 7]. 
Every minimal homeomorphism of the Cantor set is topologically conjugate to a particular type of adic 
transformation [15], and every ergodic measure-preserving transformation on a Lebesgue space is isomorphic 
to an adic transformation with a unique nonatomic invariant measure [39, 40]. The families of invariant 
sets for adic transformations correspond to tail fields in probability theory and ergodic theory, so ergodicity 
of invariant and quasi-invariant measures for adic systems is equivalent to 0,1 laws, which guarantee the 
triviality of these tail fields — see [28, 32, 33, 27]. An especially regular and simple- looking nonstationary 
adic is the one based on the Pascal triangle regarded as a graded graph. Its cr-algebra of invariant sets 
corresponds to the exchangeable or symmetric cr-algebra in {0, 1} N , the sets fixed by any permutation of 
finitely many coordinates (whereas the usual tail cr-algebra consists of the sets invariant under any change 
of finitely many coordinates). 

In this paper we establish several dynamical properties of the Pascal adic transformation. It is known 
that the set of nonatomic ergodic invariant measures for this system is a one-parameter family corresponding 
to the Bernoulli measures on {0, 1} N , as explained below. For each ergodic invariant measure we identify 
the asymptotic law of the return time to cylinder sets determined by finite initial path segments (Theorem 
3.2). The original Pascal transformation is not defined everywhere, which means that we are dealing with 
a noncompact topological dynamical system. Attempts at compactification or at forming quotients lead 
to discontinuities. To overcome these difficulties, we use a countable family of substitutions to produce a 
subshift on an alphabet of two symbols, {a, &}, which represents the Pascal adic except for countably many 
points (Theorem 4.1). This subshift consists of all subwords of all "basic words" formed by concatenating 
words, rather than adding integers, in Pascal's triangle. The basic word found at place k in row n has 
length equal to the binomial coefficient C(n, k) found at the same place in the actual Pascal triangle, and 
the structure of the word conveys some information about the history of its formation and therefore also 
some extra information about the binomial coefficient. Not only does this subshift have zero entropy (it was 
known before that all the invariant measures for the Pascal adic transformation have entropy zero), but we 
can determine its complexity function: the number of n-blocks is asymptotic to n 3 /6 (Theorem 4.6). While 
the subshift supports uncountably many ergodic invariant measures, it has a property that we call directional 
unique ergodicity: once a ray in Pascal's triangle beginning at the root is fixed, when we consider occurrences 
of a given block B only in basic blocks near that ray, the limiting frequency of occurrences exists and equals 
the measure of the cylinder set [B] according to the ergodic invariant measure parametrized by the angle of 
the ray (Theorem 4.7). Finally, we use a characterization of weak mixing by Keynes and Robertson [19] and 
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Wcyl's theorem on uniform distribution to show that the subshift is topologically weakly mixing (Theorem 
4.19). 

2. The Pascal adic transformation 

We define the Pascal adic transformation first in terms of its graph, then we give the cutting and stacking 
model to which it is isomorphic. 

2.1. The graph construction. The Pascal graph is an infinite planar graph divided into levels n = 0, 1, . . . , 
with a root vertex at level labeled (0, 0), and n + 1 vertices at each level n labeled (n, k) for k = 0, . . . , n. 
From each vertex (n,k) leave two edges; one goes to (n + l,k + 1) and is labeled by 1, and the other 
goes to (n + 1, k) and is labeled by — see Figure 1. The space X considered is the set of infinite paths 



(0,0) 



Level 



Level 1 



Level 2 



Level 3 



Level 4 




Figure 1. The Pascal Graph. The number of finite paths from the root to a vertex (n, k) 
is given by the binomial coefficient C(n, k) = n\/[k\(n — k)\]. 



going from the root down the graph, i.e. the set of all (n, k n ) n >i, where < k n < n and fc n +i = k n or 
k n + 1. The labeling of the edges produces a natural one-to-one correspondence between X and the set 
{0, 1} N of infinite sequences of 0's and l's. The space X is compact for the product topology, and we denote 
by B the Borel er-algebra. Let d be the usual metric on the space {0, 1} N (letting d(x,y) — (n + 1) _1 
whenever x and y disagree for the first time below the n'th level). A cylinder set in X is a set of the type 
{x £ X : Xi ± = ai, Xi 2 = a2, . . . , Xi B = a s }, and the family of cylinder sets generate B. For convenience 
we will often denote by [a\a2 ■ ■ ■ a s ] the cylinder set {x £ X : x\ = ai,X2 = 02, . . . ,x s = a s }. We will 
refer to a point x £ X by (n, fc n (a;)) n >i or by ■ ■ ■ , where xi, X2, . . . are the successive labels of the 

edges of x and k n (x) = J27=i x i- We put a partial order on X, writing x < y for x, y € X, whenever x 
and y coincide below a certain level n and x„ < y n . In other words, x and y are comparable with respect 
to this partial order if for some n G N n } - — yj for all j > n and there is a permutation n g S n such that 

7r(xi, . . . , x n ) = (v-i , y n ). Let X^ and X 

haw 1 



Xn) = (yi, ■ ■ ■ , Dn)- Let X m i n and X max be respectively the set of minimal and maximal paths. We 



Xmax — {^niax — 01 : 1 > 1} U {x max — , '. 



1°°} 



X u 



1*0° 



i > 1}. 



Definition 2.1. The Pascal adic transformation is defined from X\X max onto X\X m i n by Tx = smallest 
y greater than x. For every x £ X \ X max there are positive integers n, to, and x' £ {0, 1} N such that 
x = 0™l m 10x'. Hence an equivalent definition (illustrated in Figure 2) of T is 

T(0 n l m 10x') = l m 0"01aj / . 

Note that if x = l k n ~ k .... i.e. x coincides with the minimal path through the vertex (n, k), then T l x 
for i = 1, . . . , C(n, k) — 1 goes through all C(n, k) finite paths from (0, 0) to (n, k). 
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There is a natural way to extend T bijectively on the whole space X by sending maximal paths to minimal 
ones: 



Tx % 
Tx° 
Tr°° 



mm 


max 7 
oo 



for all i > 1, 



Unfortunately, this extension is not continuous at the points £ max , x m£ 




Figure 2. T permutes finitely many coordinates and leaves the others unchanged; it 
"carries" a path transversely to the shift transformation. Illustrated in the figure are : 
x = 0011100100... and Tx = 1100010100.... 

A simple observation is that if a path x has a "kink" at level n, i.e. if s n+ is n+ 2 = 10 — see Figure 3 — 
then x comes back close to itself after C(n, k n (x)) steps: 

Lemma 2.2 (The "Kink" Lemma). If x G X be a path such that x n +iXn+2 — 10, then T c ^ n ' kn ^ x ^x and x 
coincide along the first n coordinates. 

Proof. Consider the following two paths (depicted in Figure 3) lying in the orbit of x: 

x+ = o n ~ k ^ x h k ^ x hox n+3 x n+4 . . . 

x- = l k ^0 n ~ k ^01x n+3 x n+4 .... 

The adic takes the path x to x + after I iterations, for a certain integer I. Then the adic applied once to x + 
brings us to x~ . And finally, it takes m iterations of the adic, for a certain integer m, to match up the first 
n coordinates of x. The total number of iterations we have made is simply the total number of finite paths 
from (0, 0) to (n, k n (x)), so that I + 1 + m = C(n, k n (x)), establishing the lemma. 

□ 

Denote the orbit of a point x £ X by 0(x) = {T n x : n G Z}. (X,T) is not quite a minimal topological 
dynamical system (in the sense of a homeomorphism between compact spaces) , but the Kink Lemma implies 
that if 10 appears infinitely many times in x, then x has a dense orbit: 

Proposition 2.3. T : X \ X max — > X \ X m i„ is a homeomorphism, and for every x € X , exactly one of the 
following holds: 

(i) 0{x) = {x° max } or 0(x) = {x™ ax } (x is a fixed point) 

(ii) there exists n > 1 such that x™ nax G 0(x) (the orbit of x is infinite but not dense) 

(iii) 0(x) — X (x has a dense orbit). 
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FIGURE 3. The "Kink" Lemma 

2.2. Ergodic measures. If p = (p, 1 — p) is the probability on {0, 1} which gives mass p to and mass 
1 — p to 1, then the product measure \i v = p® N on {0, 1} N is called a Bernoulli measure and is often denoted 
by B(p,l-p). 

As noted in [27, 28] the result that the invariant ergodic Borel probability measures for the Pascal adic 
are the Bernoulli measures B(p, 1 — p) is well known; it has been proved by using the Ergodic Theorem or 
the Martingale Convergence Theorem. A more geometric and calculation-free approach developed in [23] 
permits extension of these results to a wider class of systems, the generalized Pascal adics. The statement 
that any T-invariant ergodic measure is a Bernoulli measure can be attributed to dc Finetti in the context 
of exchangeable processes. The converse, stating that every Bernoulli measure is ergodic for the Pascal adic, 
follows from a result by Hajian, Ito and Kakutani on a system isomorphic to the Pascal adic defined by 
interval splitting [14]. The connection with adics was made by Vershik [39]. 

In fact the Pascal adic is totally ergodic (every power T" is ergodic). This is equivalent to saying that 
T does not have any eigenvalues (other than 1) which are roots of unity, which follows from the self-similar 
structure of Pascal's triangle modulo any prime (a consequence of a result of Lucas [22, 2]). 

2.3. The cutting and stacking equivalent. Start by dividing the unit interval into two equal pieces. At 
each step, the stacks are divided into two equal halves, and the right half of each stack is placed on the 
bottom of the left half of the following stack — see Figure 4. If we repeat indefinitely, the resulting map Tt, 
(which maps every open interval of each stack to the one above it) is defined everywhere except at the dyadic 
rationals (which correspond to the paths which are eventually diagonal in the graph construction). Denote 
by m Lebesguc measure, and let 2?([0, 1]) be the cr-algebra of Borel sets in [0, 1]. ([0, 1], 2?([0, 1]), Tb, m) is a 
measure-preserving system which we will refer to as the binomial transformation. 

Proposition 2.4. The systems ([0, 1], B([0, 1]), m) and (X, B,T, H1/2) are isomorphic. 

Proof. The isomorphism ip : [0, 1] \ {dyadic rationals} — > X \ {y : x 6 A m ; n U A max and y € O(x)} is 
defined by VKS^li x i^~ % ) = x i x 2 ■ ■ ■ ■ By induction one can check that at step n the bottom and top 
levels of the «'th stack in the cutting and stacking are mapped respectively to the cylinders [i™- 4 + 1 o i_1 ] 
and [0 I-1 l"~ i+1 ]. Note also that two points exactly above one another have dyadic expansions which 
eventually coincide. If x is not a dyadic rational then for some level n it belongs to the right half of 
the top level in some stack, i.e. i[>( x ) = i_1 l n_ ' +1 0a;„ + ia; Il +2 ■ ■ ■ for some i € {0, ... ,n — 1}. Therefore 
ip(Tt,(x)) = l n_I l lx n +ix n +2 ■ ■ • = T(ip(x)). Furthermore, ijj takes the Lebesguc measure m to the Bernoulli 
measure fi±/2, since the measures of the levels at step n are equal to 2~ n . □ 

Remark 1. The inverse of this isomorphism also carries any Bernoulli measure B(p, 1 — p) on {0, 1} N to the 
Cantor measure m p on [0, 1]. Another point of view is to cut and stack with proportions p and 1 — p\ then 
Lebesgue measure carries to the Bernoulli measure B(p, 1 — p). 
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Figure 4. The binomial transformation, or cutting and stacking construction of the Pascal adic 




t 1 1 , 1 ( , 1 i 1 Lj 1 

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

FIGURE 5. The plot of the Pascal adic (or Binomial transformation). 

We used Matlab to produce the plot of the binomial transformation — see Figure 5. Note the symmetry 
with respect to the line y = 1 — x, which can be stated as 

Th(l - T h x) = 1 - x. 

In the adic point of view this is equivalent to 

TSTx = Sx, 

where S is the transformation of {0, 1} N which interchanges 0's and l's. We state this observation as follows: 

Proposition 2.5. The Pascal adic T is conjugate to its inverse via the map S which interchanges symbols: 
TS = ST-\ 
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Proof. Let x € X \ A" max . We can write x in the form x = O l V10x' , where i,j > and x' is an infinite 
string of O's and l's. Then 

TST(x) = TS(l j O l 01x') 
= T{On l 10S(x')) 
= VQ^OlSix') = S(x). 

□ 

3. Limit laws for return times into cylinders 

Considerable attention has been devoted recently to determining the asymptotic laws of return times or 
hitting times to "shrinking targets": sec, for example, [29, 5, 4, 25, 31, 1, 20, 8] and the references that they 
contain. We establish for the Pascal adic that the limit laws of return times into typical cylinders, when 
properly scaled, are piecewise constant. 

For simplicity we assume that /i is the Bernoulli measure B( 1/2, 1/2), but the steps below can be adapted 
to the general case. Fix a generic point u) G X. Denote by U n the cylinder generated by the coordinates 
u>i, u)2, ■ ■ ■ , w n (as above we use the notation U n = • ■ ■ u n ]). Let r n be the first return time (or entrance 

time) to [/„, i.e. 

r n (x) = inf{fc > 1 : T k x G U n }. 

We are interested in the asymptotics of the return times r„, the question being what is the limit of the 
following function when the right scaling c„ is chosen: 

G n (t) = j fi{x G U n : CnH{U n )T Un {x) > t}. 

Consider the cylinders C" m = [uj\ . . . w„0'l m 10]. Then U n = 1J Z m>0 C" m (disjoint union up to a set of 
/x-measure zero). For x G C™ m and k n = — see Figure 6 — the first return time to U n is given by 

the following formula: 

Proposition 3.1. r n (x) = C(n + I, k n ) + C(n + to, k n + m) — C(n, k n ) 

Proof. If x G C" m , it will return to U n for the first time when it enters the cylinder [lo\ . . . w„l m z 01]. After 
a certain number N\ of iterations x gets mapped to [0 n_fe " l fc "0'l m 10]. It takes C(n + l,k n ) — C(n,k n ) 
more iterations to bring x to the cylinder [o n_fc " +z l fc " +, ™10]. Applying the adic one more time takes us to 
[l fc " +m 0"- fc "+'01]. Similarly, after C(n + m, k n + m) - C(n, k n ) more iterations we are in [l fc «0 n " fc " l m 0'01] 
and after a certain number N2 of iterations x finally re-enters U n . To conclude note that Ni + N2 + 1 = 
C(n,k n ). □ 



(0,0) 




Figure 6. The first time an element of C™ m returns to U n is when it enters the cylinder [u\ . . . w„l m 0'01]. 
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Letting t" m = ^c„t„ for n > l,l,m> 0, we have 



G n (t) = 2" £ ^ X e C lra tl,m > *} = ^ £ A*(C£J 

Z,m>0 ;,m>0:t^ m >t 

= 2 » V - = - V — 



Z,m>0:t™ m >t i,m>0:t" m >t 



Using Stirling's Formula, we know that C(n, k n )/2 n m Ij^fh^n — k n ), so that when c„ grows faster than 
y/n, lim„_ >00 t™ m = and therefore lim„_ >0o G n (t) = l(-oo,o](£)i an d when c„ grows slower than y/n, then 
linin^oo i™ = 00, which implies that linin^oo G n (t) = l(_ 00i00 ). The interesting scaling is c„ = -y/n. Then 
using again Stirling's formula and the fact that k n /n — ► 1/2 fi-a.e. we get 



1 In 1 

*o,o = F V^(n, fc„) = _ Vn ^l^, 

n n / - — 



it^^ e n-L y/Mn-k n ) 

1 _ „n+l/2 



2"V2^ kn n+1/2 (n - k n ) n - k ~+y 
1 



, fen+l/2 , 

71 n \ n 



2"V2tt \l n-k n \k Ti 

n — ►oo y 7f 



i—k n 



Then 



2" 



l 0,0 



(n + m) . . . (n + 1) (n + Z) . . . (n + 1) 



(fc„ + m) . . . (fc + 1) (n + Z - kn) . . . (n + 1 - k n ) 



- 1 



(2' +2 m - 1). 

n — >oo V 7f 



We can now easily deduce that G n converges to a step function whose heights are computed below. Let 
l~2 



U,j = d - (2 4 + 2 J - 1) for i > and < j < i. 
Assume that tij < t < tij+i and j + 1 < i; then 



lim G n (t) = - lim £ 

l,m>0:tf m >t 

= - T '— 

4 L^i 2 l + m 

(,m>0:4 !iTO >t 



- y —+2 

\l=m>0:ti m >t ~" l>m>0:ti m >t 



Z^ 0'+' 



i-1 



4 £ 02/ + ^ £ 9i+™ + ^ £ £ Ol+l 



4 \ 2 2/ ^ 2 i+m ^ ^ 2 l 
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Similarly, if tij < t < ti+i ; o (the case i = j), then 



-. / OO oo I— 1 



\/=i+l Z=i+1 m=0 



1 



) 



_ 2~2(j+i) (2'+ 2 - l) 



To summarize: 



Theorem 3.2. GJt) = - 

A 

piecewise constant function 
fl 



1 



Vnf^(U n )T n (x) > t} converges pointwise as n — > oo to i/ie 



A»(J7„) 



7! 





where t^j = 



Remark 2. If p ^ 1/2 and /Lt is the Bernoulli measure /3(p, 1 — p), then G n (t) defined as above still converges 
to a piecewise constant function, but the formula for the function is rather messy. 



Stationary adics are topologically conjugate to substitution or odometer symbolic dynamical systems 
[13, 21, 7]. Here we use the idea described in [7, 16] to associate a substitution to an adic to show that the 
Pascal adic is isomorphic to a subshift whose language is determined by countably many substitutions. For 
basic definitions and general information about substitution systems see [30, 12]. We will use the alphabet 
{a, b} for our subshift to decrease confusion about the various representations of the Pascal adic. 

Consider the substitutions Q : {0, 1, ...,«} —» {a, b}* = all finite words on the alphabet {a, b} defined by 



Denote by £(Ci) the language associated to Ci> by which we mean the subset of {a, b}* consisting of all 
subwords of all the i > 1, < j ' < i. Let E be the subshift consisting of all sequences in {a, b} z all of 
whose subwords belong to £(() — UiC(Q). Figure 7 shows how the "basic" words in the language £(C) can 
be obtained by successive concatenations. It is not very hard to see that these basic words give the codings 
of the cylinders [l k n ~ k ] in the Pascal graph under the action of the Pascal adic transformation according 
to the first edge (left ~ label 1 ~ b, right ~ label ~ a) — or, in the cutting and stacking representation, the 
coding according to visits to [0, 1/2) ~ b or [1/2, 1] ~ a (see Figure 8). (Just note that the cylinder [l fc 0"~ fe ] 
corresponds to the bottom level of the fcth stack in the cutting and stacking at stage n, thus its coding is 
the word seen going up the stack (Figure 8), which is the word Cn(k) shown at vertex (n, k) in Figure 7). 



4. A COUNTABLE-SUBSTITUTION SUBSHIFT 



&(i) = b Ci(0) = a for alii >1 

Cs(i) = Ci-x(j)Ci-x(j - 1) for alH > 2 and < j < i. 



b a 



b ba a 



b b 2 a ba 2 a 
b b 3 a b 2 aba 2 ba 3 a 
b b 4 a b 3 ab 2 aba 2 b 2 aba 2 ba 3 ba 4 a 
b b 5 a b 4 ab 3 ab 2 aba 2 b 3 ab 2 aba 2 b 2 aba 2 ba 3 b 2 aba 2 ba 3 ba 4 ba 5 a 



Figure 7. "The Pascal triangle of words", showing the construction of the words by suc- 
cessive concatenations from level 1 to level 6. The word written at the "vertex" (n, fc) of 
this triangle represents the coding of the cylinder [l fc n ~ fc ]. 
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Theorem 4.1. Let {X,T) denote the Pascal adic. There are a countable subset X' C X and a one-to-one 
Borel measurable map <j> : X \ X' — ► E such that <f> o T = a o <f> on X \ X' . Consequently, for all a € (0, 1), 
for each nonatomic ergodic measure \i a on (X,T), if V a = fj, a o tfi^ 1 , then (X,T,/j, a ) and (E, a, v a ) are 
measurably isomorphic. 

Proof. Let P = {[0], [1]}, where we denote as previously [0] = {x € X : xq = 0} and [1] = {x € X : xq = 1}. 
Let X' be the countable subset of X consisting of all paths which are eventually diagonal, i.e. x € X' if and 
only if there exists n £ N such that either Xk = for all k > n, or Xk — 1 for all k > n. Let <\> be the coding 
of the Pascal adic (X, T) by the partition P, where [0] is coded with a and [1] with b. More precisely, for 
every x G X \ X' we define 

00*0 = (■ • ■ , W-2, W , CJi, £J 2 , • • .)) 

where 

_ Jo if (T J a;)o = 
^ _ \6 if (T*aOo = 1. 

The map <fi intertwines T and a (this is clear) and establishes a measure-theoretic isomorphism between 

b a 



a 

b b 



a a 
b a 
b b b a 



Figure 8. Coding by a's and 6's in the cutting and stacking. 

(X,T,[i a ) and (E,cr, v a ). To check this, it is enough to show that <j> is one-to-one, or, equivalently, that 
P is a generating partition for T . Let x, y G X be two different paths which are not eventually diagonal. 
Since x =/= y there is a smallest integer n such that x n ^ y n , and we can assume that x n = 1 and y n = 0. 
In addition, since x X max , x n +j — for some smallest j > 1. Consequently, the P-names of a; and y 
coincide until a; and y get mapped after iterations to the cylinder [0 z l m ] for some N,l,m > 1 (where 
to = fc„_i(x) = fc„„i(y), Z = n - fc„_i(a:), and < N < C{n,k n -i{x))). Then, since T N x = Q l l m+j+1 Q . . . 
and T w y = ( l m . . . , it follows that the P-name of T N x is b m+j+1 a . . . whereas the P-name of T N y is 
b m a . . . , showing that (f>{x) ^ 4>{y) — see Figure 9. 

□ 

4.1. Complexity. It is easy to see that the subshift (E, a) has topological entropy 0. Finer measures of the 
size or richness of symbolic dynamical systems can be drawn from asymptotics of the complexity function, 
which for each n gives the number p n of n- blocks found in sequences in the system. See [9, 11, 3, 12] for 
some examples of results on the complexity functions of various systems. Usually one finds upper or lower 
estimates on the growth rate of a complexity function. Here we show that for the subshift (E,<r) that we 
have associated to the Pascal adic, p n is asymptotic to n 3 /6. 

Let B n j — (j(n) denote the block at the vertex (n,j) in Figure 7 (for example P3.2 = b 2 a). We continue 
to refer to the B n ^s as basic blocks. These blocks satisfy the recurrence formula 

(4.1) B nJ = l!„ : .,/>'„ 1,, :• for < j < n. 

Every basic block at level n can be written as the product (we use interchangeably "product" and "con- 
catenation") of two basic blocks from level n — 1, and every basic block at level n — 1 can be written as 
the product of two basic blocks from level n — 2, and so forth... Thus we have a hierarchical decomposition 
of the basic blocks. We can view any block B n j as the product of basic blocks from level I for any I < n, 
depending on how far back in the hierarchy we want to look. 
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x 



Figure 9. After N steps x and y are mapped to the "maximal" cylinder [0'l m ]. At step 
N + 1, x is mapped to l m +-? -1 0' whereas y is mapped to l" l 0'. This shows that x will then 
stay in the cylinder [1] coded by b for m + j — 1 steps until it is mapped to the cylinder [0] 
coded by a, whereas it will only take m steps for y to get mapped into [0]. 

Definition 4.2. We say B £ £(C) is a new block at level I if B appears as a subblock of one of the basic 
blocks Bi k at level I, but does not appear as a subblock at level I — 1 (and a fortiori at any lower levels). 

We would like to count the number of different new n-blocks at level I for I = 1,2, ... . Although the 
recursive construction of the blocks is simple, it is not clear how to count precisely the different n-blocks 
since a given n-block can appear many times at the same level. The concatenation of two consecutive basic 
blocks at level I will result in the formation of new blocks at level I + 1, but how can we tell whether a block 
which overlaps two such basic blocks didn't appear higher in the Pascal triangle of words (see Figure 7)1 
After which level will we have seen all the different n-blocks? The following lemma provides an answer to 
these questions. 

Lemma 4.3. Let B be an n-subblock at level I + 1 of Bi + \j, for 2 < j <l — l. Assume that I, j are such that 
|-B;_ij_i| > max{n — (/ — j), n — (j — 1)} (which guarantees that B does not overlap Bj— i,j or S;_i jJ -_2^. 
Then B is a new n-block at level I + 1 if and only if B contains a J 6^ 1 . 

Example 4.4. Observe in Figure 7 that the only new 5-blocks which appear at level 6 as subblocks of 
#6,4, ^6,3, #6,2 are 

bab 3 , ab 3 a, ba 2 b 2 ,a 2 b 2 a, ba a b, a 3 ba. 
They all contain the block a 5- ^b^ -1 for some j. 

Proof. Suppose that B is a new block at level / + 1 not containing the subblock a For example, in 

the case Bi j = J3 6 4 and B% j_\ = B 6 3 , we have the following picture: 

Bl — i t j—i Bi—i,j — i 

A A 

Bi tj = b i a\ b 3 ab 2 aba 2 b 3 ab 2 aba 2 \b 2 aba 2 ba 3 = 
Bij = 6 4 a 1 6 3 ab 2 abaa bbb aab 2 aba 2 \b 2 aba 2 ba 3 = -B(j_x. 
~b~ ^ b ' 
(Note: We use "|" to symbolize where the concatenation at the previous level took place.) 

Observe that a l ~i is a right factor of B[ j, and that W~ l is a left factor of B[ Consequently, there 
exists k such that either B = u>b k and 1 < k < j — 2, or B = a k oj and 1 < k < I — j, where u) is 
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respectively a right or left factor of Bi-ij-x- The latter follows from the hypothesis that > 
max{n — (Z — j),n — (j — 1)} > \B\ — k. Suppose that B = ub k (the case B = a k ui is similar). Since 
_B;.j_i = S;_ij_iB;_i ) j_2 = Bj-ij-iZ^ -2 . . . , it follows that B is a subblock of This contradicts 

the fact that B is a new block at level Z + 1. 

Conversely, observe that for 1 < j < I — 1, Bij = Va. . . 6a'~ J . Thus the first time the block cr J Z> 7 _1 
will be seen is at level I + 1, as a subblock of BijBij-i. □ 

As a corollary we get: 

Lemma 4.5. ^4ZZ n-blocks are seen at level n + 2 as subblocks. 

Proof. Assume there exists a new n-block B at level n + 3. First note that i? cannot be a subblock of any 
of the following "edge" blocks: 

B n +3,n+3 = b 7 -B n +3,ri+2 = b n+2 a, B n+ z.i = ba n+2 , B n+ ^^ = a. 

Otherwise B = b n ,b n ~ 1 a,a n , or 6a™ _1 , and those blocks are already seen at level n + 1. Therefore B is 
coming from the concatenation of B n+ 2,j and B n+ 2,j— lj for some j with 2 < j < n + 1. By Lemma 4.3 
(applied in the case Z = n + 2), B must contain the subblock a"" 1 " 2 "^' -1 , which is impossible. □ 

Theorem 4.6. lim ^ = — . 

Proof. Fix n. Consider the Pascal triangle of words from level 1 to level n + 1. We will count the number 
of n-blocks by estimating how many new n-blocks are created from one level to the next by concatenation 
of two adjacent basic blocks. We divide the triangle into two disjoint regions: let region I be the subset of 
the triangle formed by all blocks up to level v2n, as well as all blocks below the level s/2n located at the 
vertices (Z, k), for \phi < I < n + 1 and k = 0,1,1 — 1,1. Let region II be the complement of region I in the 
triangle considered — see Figure 10. Let pi be the sum for I = 1 to n + 1 of the number of new n-blocks at 
level Z appearing as subblocks of the product of two adjacent basic blocks, one of them at least belonging 
to region I. Similarly, let pu denote the sum for \/2n < I < n + 1 of the number of new n-blocks at level I 
appearing as subblocks of the product of two consecutive basic blocks, both of them belonging to region II. 
By Lemma 4.5, since all n-blocks are seen at level n + 2, we have the following inequality: 

PU < Pn <PI+ PU- 

In region II, note that the hypothesis of Lemma 4.3 is satisfied since if Bij and -B;,j-i are two consecutive 
blocks in that region |B/_x ti j_i| > (\/2n + l)V2n/2. By Lemma 4.3, a subblock B of BijBi j^i is a new 
n-block at level Z + 1 if and only if oJ^ity^ 1 is a subblock of B. Therefore, the number of new n-subblocks 
of Bi jBi_j-i is equal to n — I. Thus 

Pu= J2 («-0(J-3) 

V2n<l<n+1 
n+1 n+1 

1=1 1=1 

(n+l)(n + 2) (n+ l)(n + 2)(2n + 3) , 
= n + o(n ) 

n 3 2n 3 . ,, n 3 . ,. 
= Y- — +o(n*) = - + o(n*). 

For region I, a coarse approximation gives 

Pi< J2 (n-l)(Z-l) + 4 ^ (l-l) = o( 

l<l<V2n \/2n<t<n+l 



n 3 ) 



It follows that 



— + o(n 3 )< Pn < — + (n 3 ). 
6 6 



□ 



An immediate corollary is that the countable-substitution subshift (£, a) has topological entropy 

Zi top (S,cr) = hm = 0. 
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Figure 10. In region II, two consecutive basic blocks Bij and Bi j_i at level I will generate 
n — I new n-blocks at level I + 1, according to Lemma 4.3. 

The variational principle implies that the measure-theoretic entropy h(a) with respect to every measure v a 
is zero. Since entropy is an isomorphism invariant, this proves that the Pascal adic has zero entropy for 
every Bernoulli measure /j, a . 

4.2. Directional unique ergodicity. For any two blocks B and C in the language £(C), let £req(B,C) 
denote the frequency of occurrences of B in C . In particular, if w G S, and if (*>_ N denotes the block 
uj^n^~n+i . . . Wo . . . wjv-iWjv, then, provided that 2N + 1 is greater than \B\ (the length of B), we have 

N-\B\ 

freq(S,w5 Ar ) = ^— T £ l [B] o^(w). 

»=-JV 

The ergodicity of each v a = 4>()J. a ) (carried by the isomorphism) implies therefore that for every block 
B E £(C) and v a -&.c. uj e S 

(4.2) lim hcq{B,uj N N ) = v a {B). 

N^OQ 

Let i be a path in the Pascal graph going through the vertices (n, fc n (x)), and let uj = <p(x). If x is not 
eventually diagonal, then there are sequences i n (x) f oo and j n {x) /* oo such that B n kn ^ = w^ ) x y 
where B n ^ n is the basic block at vertex (n, k n ) in the Pascal triangle of words. Therefore, (4.2) implies that 
for ^ Q -a.e. x 

lim heci(B,B n kn{x) ) =v a {B). 

n — >oo 

In other words, when (n, k n ) for n = 1, 2, . . . are the vertices of a generic path, which goes down the Pascal 
graph at an "angle" a, we have freq(-B, B n ^ n ) v a (B). We strengthen this statement as follows: 

Theorem 4.7. For any block B £ £(C)j ^2/ ot G (0? l)j any sequence k n — > oo suc/i i/iat fc„/n — > a we 
have 

lim freq(-B, _B n ,fc J = v a (B). 

n— >oo 

To prove Theorem 4.7 we determine explicitly where a given block B is made and how many times it 
appears in each basic block. First we introduce some notation and recall the key structure of the basic 
blocks. For a fixed block B £ £(C)j denote by a(B,n,k) the number of occurrences of B in B n ^. If no 
confusion is possible we will simply denote it by <x(n, fc), so that 

(4-3) ^(B,B n>k J = p^ = p^. 

\B n ,kJ C\n,k n ) 
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Figure 11. For any block B n ^ in the filled area the frequency of appearances of B in B n ^ 
is e-closc to v a {B) (for S small and N large). 



By induction it is easy to show that 

B rhk = b k a . . . ba n - k - 1 b k - 1 a . . . ba n ~ k . 

This decomposition characterizes the basic blocks B n ^', if one sees the telltale block ba rl ~ k ~ 1 b k ~ 1 a, it is 
always within B n ^, at the "join", as seen above. The previous structure is easily seen by writing only the 
beginning and ending of the basic blocks in the Pascal triangle of words; for example, at level n we have: 

b £2^3 ■■ b k a...ba n - k ... ba™^ a 
Figure 12 shows how the structure is carried from one level to the next: 



b*- L a...ba T 
b k a...ba n - k - 1 b k - x a . . .ba n ~ k 



b k+1 a...ba n - k - 1 b k a. 



ba n-k 6fc-i a ...b ™-fc+i 



b k+2 a . . . ba n - k ~ 1 b k+1 a...ba n - k b k a . . .ba n - k+1 6 fc - 1 a . . .ba n - k+2 



Figure 12 



Lemma 4.8. Let B £ £(C)- Then there is a unique vertex (no, kg) such that B is a subblock of B no ^ and 
B does not appear in any other basic block in the "rectangle" above (no, fco) formed by all vertices (n, k) with 
k < fco and n — k < no — ko . 

Proof. Assume that B is a subblock of both B na ^ and B n > )( y , and that B does not appear in any other 
basic block in the "rectangles" above (no, fco) an d (n ,fco) (in particular we can assume that n > n and 
k'o > ko) — see Figure 4.2. Recall that 

B no ,k = b k °a . . . ba n °- k °- 1 \b k "- 1 . . . ba n °- k ° 

B n ' ,k' = b k '°a . . . 6a n «" fc »~ 1 |6 J °- 1 . . . ba<~ k '°, 
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so that B must contain a central subblock of both 



1 fcn 1 



fe'-l 




no — feo — 1 

(By central we mean containing a\b, where "|" symbolizes the splitting into blocks of the above level). The 
only way it can happen is if B = a'b 3 with i < n — fc — 1 and j < ko — 1. Therefore B had to appear at the 
vertex (n — k' + ko, feo) whose basic block is . . . ba n °~ k °~ 1 b k °~ 1 a . . . , contradicting our initial assumption. 

□ 



Region where we assume B does not appear 




Remark 3. From the uniqueness in Lemma 4.8, it follows that if the block B appears for the first time at 
the vertex (no, feo), then it will only appear as a subblock in the triangle below the vertex (no, feo), as shown 
in Figure 3. 



Region where the block B appears 




All vertices (n + i, k + j] 
for i > and < j < i. 



Lemma 4.9. If a block B appears in B„ ^ for the first time, then it can appear at most twice (i.e a(no, feo) 
1 or 2). 

Proof. If B appears for the first time in B n0t k , then it is a central subblock of 



b ko 



n — fc — 1 ILfco — 1 



|6* 



. a 



n a — k 
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There are two possibilities: either B contains the telltale block ba n °~ k °~ 1 \b k °~ 1 a, or it does not. In the first 
case, B cannot appear twice, since the telltale block appears only once in B no ^ - In the second case, the 
only way that B could appear twice is if it would start with a l 6 feo_1 for some i < uq — fco — 1, and end with 
a n -k -xp for gome j < fc - 1: 

D 



Jjka a n °~ k °~ 1 | b k °~ 1 a n o-ka 

foko a no-fco-l| 6 fco-l a n °~ k ° 

S v ' 

B 

Then clearly B cannot appear a third time. □ 

Given a block B £ £((,), assume that B appears for the first time at vertex (no, fco) in the Pascal triangle 
of words (this first appearance is unique by Lemma 4.8). Define the triangle of appearances of B to be 

a(n , fco) 
a(no + l, fco + 1) a(n + l,k ) 
a(n + 2, fc + 2) a(n + 2, fc + 1) a(n + 2, fc ) 

By Lemma 4.8, all other frequencies arc zero, namely a(n, fc) = for all vertices (n, fc) not equal to (no + 
i, fco + j) where i > and < j < i — see Figure 3. 

Example 4.10. The triangle of appearances of B = a 2 b 2 is 

I 

2 2 
3 5 2 
4 9 9 4 



where the first element in the triangle is a(6, 3). 

It turns out that the triangle of appearances of a block B is either of the type of example 4.10 or the 
Pascal triangle with certain "initial diagonals" removed. 

Lemma 4.11. Let B £ £(f) and assume B appears for the first time in B no ^ Q . Let i$ = no — fco — 1 and 
j = ko — I. Then there are exactly five possible cases for the triangle of appearances of B: 
Case 1 : B = a 10 !^ . The number of occurrences of B satisfies the recurrence relation 

a(n, fc) = a(n — 1, fc) + a(n — 1, fc — 1) + 1, 

and the triangle of occurrences of B is 

1 

2 2 
3 5 2 
4 9 9 4 

Lt follows that a(n, fc) = C(n — n + 2, fc — fc + 1) — 1. 

Case 2 : B = . . .ba l °b J0 a. . . (B contains the telltale block). The number of occurrences of B satisfies the 
recurrence relation 

a(n, fc) — a(n — 1, fc) + a(n — 1, fc — 1), 
and the triangle of occurrences of B is just the Pascal triangle. 

1 

1 1 
1 2 1 
13 3 1 
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It follows that a(n, k) = C(n — no, k — kg). 

Case 3 : B = a l B no —2,k — ib J for some i < iq and j < jo- The number of occurrences of B satisfies the 
recurrence relation 

a(n, k) = a(n — 1, k) + a(n — 1, k — 1), 
and the triangle of occurrences of B is 

2 

3 3 
4 6 4 
5 10 10 5 

It follows that a(n, k) = C(n — no + 2, k — ko + 1). 

Case 4: B = . . .ba % °\P for some j < jo, and B is not as in Case 3. The number of occurrences of B satisfies 
the recurrence relation 

a(n, k) = a{n — 1, k) + a(n — 1, k — 1), 
and the triangle of occurrences of B is the following 

1 

2 1 
3 3 1 
4 6 4 1 

It follows that a(n, k) = C(n — no + 1, k — ko). 

Case 5 : B = a l b 3o a. . . for some i < io, and B is not as in Case 3. The number of occurrences of B satisfies 
the recurrence relation 

a(n, k) = a(n — 1, k) + a(n — 1, k — 1), 
and the triangle of occurrences of B is the following 

1 

1 2 

1 3 3 
14 6 4 

It follows that a(n, k) = C(n — no + 1, k — ko + 1). 

Proof. If B appears for the first time in -B„ .fc , then clearly B is one of the following: 

(i) B = a io l>> 

(ii) B = . . .ba io ti°a. . . 

(iii) B = . . .ba z °b^ for some j < jo 

(iv) B = a l 6 J0 a. . . for some i < io- 

In Case (i), which corresponds to Case 1, the block B appears only once in B n0t k , and is made infinitely 
many times further down. More precisely, every concatenation of two adjacent basic blocks below no creates 
a unique block B. It follows that a(n, k) = a(n — 1, k) + a(n — 1, k — 1) + 1, and the triangle of occurrences 
of B is 

1 

2 2 

3 5 2 
4 9 9 4 

In the previous triangle, a(n + no, k + ko) represents the number of finite paths from the root (0, 0) to the 
vertex (n, k) in a modified Pascal graph with extra "wormholes" — paths connecting directly (0, 0) to each 
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(n, k) — see Figure 4.2. It is not hard to see from this graph that a(n + no, k + ho) + 1 is equal to the sum 
of the binomial coefficients inside the rectangle determined by (0,0) and (n,k), i.e. 

n—k k-\-i 

a(n + n ,k + k ) + l = ^^CjjJ-i). 

i=0 j=i 

Using well-known properties of the binomial coefficients we get 

n—k k+i 

a(n + n ,k + k ) + 1 = ^ ^C(j, j - i) 

i—0 j—i 
n—k 



= ^C(fc + i + l ) fc+l) 



i=0 



= C(n + 2,k + l), 
showing as announced that a(n, k) = C(n — no + 2, k — ko +!) — !. 



(0,0) 




In Case (ii), a similar argument as in the proof of Lemma 4.9 shows that B can be made only once. 
Therefore a(n, k) = a(n — l,k) + a(n — 1, k — 1), and the triangle of appearances of B is just the Pascal 
triangle. Since the first clement in the triangle is a(rio,fco), it follows that a(n,k) = C(n — uq, k — ko), 
establishing Case 2. 

A special case of (iii) and (iv) is Case 3, i.e when B = a l B no _2,k — for some i < io and j < jo. Here, 
B is made twice at (no, ko), and infinitely many times along the edges of the triangle below (no, ko). First, 
observe that 

£>n ,k — ■■■ a -Dri -2,fc -l|-fc)no-2,fco-l O, 

Therefore B appears twice in J5 ra0i fc : 

B 



b k _ .. "o-fco-2 Bfio _ 2)fco _ l | 6 feo-l a no-k 

b k ° a Bo - fco - 1 |B no _2 1 fc -i& fco - 2 a n °- k ". 



B 

Second, note that B can be made exactly once at each vertex along the edges of the triangle below (no, fco): 

B 



^no+j^o — . . . u |jJ no _ 2, fco — l u 

and 

B-no+jM+j = • • • k ° 2 ^no-2,fc -i|^ 
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for all j > 1. Thus a(n, k) = a(n — 1, k) + a(n — 1, k — 1), and the triangle of occurrences of B is 

2 

3 3 
4 6 4 
5 10 10 5 



which is a subtriangle of the Pascal triangle, hence the relation a(n, k) = C(n — no + 1, k — ko) follows. 

To conclude, assume that B is as in (iii) but not of the type B = a i S no _ 2 ,fe D -i& J '. Then B = ... ba n °- k °- 1 V 
appears only once in B no ^ , because in order to appear twice it would have to contain the entire block 
B no —2,k — l- For the same reason it cannot be made again along the vertices (no +j, ko) for j > 1. Any two 
adjacent basic blocks inside the triangle below (no, fco) will n ot produce any more B's either, because they 
are all of the type . . . ba l \Va . . . where i > Iq. On the other hand, B is created once more at each vertex 
(n +j,k Q + j) for j>l: 

JD n +],k a +] ~ uu \ u 



Therefore the triangle of appearances is 



1 

2 1 
3 3 1 
4 6 4 1 



As previously this is a subtriangle of the Pascal triangle, thus a(n, k) = C(n — no + 1, k — ko), and Case 4 
is established. Case 5 is similar. □ 



Lemma 4.12. Let r > and < s < r. If k n /n — > a as n — ► oo, then 

C(n — r, k n — s) 



lim 

n — >oo 



C(n, k n ) 



a s (l-a) 7 



Remark 4. This was the key calculation used by Hajian, Ito, and Kakutani to prove the ergodicity of /x c 
for the Pascal adic — see [14], and also [28, Theorem 2.7]. 

Proof of Theorem 4-7. By (4.3) we need to show that 

n^oo C (n, k n ) 



(4.4) 






when k n /n — 


a. i 


Case 1 


C(n 


- n 


Case 2 


C(n 


- no 


Case 3 


C{n 


- n Q 


Case 4 


C(n 


- n Q 


Case 5 


C(n 


- n Q 



In each of these cases Lemma 4.12 shows that the limit in (4.4) equals respectively: 



Case 1 


a ko 


-\1 


-a) n °- 


fco 


Case 2 


a ko 


(1- 


Q,)«0-fe0 




Case 3 


a k ° 




- a) n "- 


feo 


Case 4 


a ko 


(1- 


a) n °~ k ° 


-1 


Case 5 


a ko 




- a) n °- 


feo 



On the other hand, v a (B) = ^ v a (B n ^), where the sum is taken over all (n, k) such that B is created 
in the basic block and if B is created twice in B n ^k, then v a (B n ^k) appears twice in the sum. In 
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particular, if B = B n ^ we have v a {B) = ar(l — a) n . So for example, if B is as in Case 3, we have: 



v a {B) — 2v a (B na , ka ) + ^2 Va(B no+ iM +i) + Va(B no +i,k ) 

i=l i=l 

oo oo 

= ^ ^(Sno+uto+i) + 2J ^(-Sno+J.fco) 
i=0 
oo 

= ^ a fco+< (1 - a)"°- fco + a feo (1 - a ) n °- ko+i 



i=0 

Q fco (l -a) Tl °- fco 
a fc °(l - a)"°- fco 



OO 

E' 

i=0 



a fe °(l - a) 



ra — feo 



oo 

,i=0 
1 

1 - a 
a + 1 - 



CSO 



+ > (l-a)« 



t=0 

1 

Q 



a) 



a(l 

= a fto_1 (l - a)™ 0- ' 50-1 . 
Similar computations show that the identity (4.4) also holds in the other cases. 



□ 



Remark 5. We could have used the Ergodic Theorem to show that the limit in (4.4), which is known to 
exist because of Lemma 4.11, must be equal to v a (B): as noted previously (p. 12), for fi a -a.e path x we 
have k n (x) jn — > a and 

a(n, k n (x)) 



lim 

n — >oo 



MB)- 



C(n, fc„) 

Remark 6. We thank Anthony Quas for an insight which leads to a proof simpler than the preceding one. It 
is possible to show that that after a while (that is, below a certain level) "new" creations of a block B due 
to concatenations constitute a negligible fraction of the total number of appearances of B. More specifically, 
fix an allowed block B e £(£) and a positive integer to large enough that B appears at level m of the 
Pascal triangle of words. Consider a level n with n > to. As mentioned above, any basic block B n ,k at this 
level factors uniquely into basic blocks of level to. Let us call "old" appearances of B in B n k the ones that 
are contained entirely in one of these basic blocks of level to, and "new" appearances the rest. The new 
appearances were created by concatenations after level to and contain division points of the factorization of 
B n ^ into basic blocks B m j. 

Let D(n, k) denote the length of B n ,fe when regarded as a word on the alphabet whose symbols are basic 
blocks of level to; thus D(n, k) is one more than the number of division points in the factorization under 
discussion. We have 

D(m,j) = 1 for all j = 0, 1, . . . , m, 
D(n, 0) = D(n, n) = 1 for all n > to, 
D(n, k) = D{n — 1, k — 1) + D(n — 1, k) for all n > m, k = 1, . . . , n — 1. 

Viewing the array whose (n, k) entry is D(n, k) as the sum of rn + 1 Pascal triangles originating at all the 
places in level to (since each of these places has an entry 1, which is the "seed" for an entire Pascal triangle 
below it), we have 



D(n,k) 



C(n — to, k — j). 



By Lemma 4.12, for n 3> to and k/n ps a, 



D(n,k) 
C(n,k) 



E 



C(n — m,k — j) 
C(n,fc) 



^ a J (l - a) m - 3 < (sup{a, 1 - a}) m+1 , 



so only a small fraction of the appearances of B in -B^fc are "new" . 

The frequency of appearance of B m j in B n ^ is C(n — to, /c — j)/C(n, k), which has a limiting value as 
n — > oo, fc/n — > a. Taking into account the number of times that our given block B appears in each basic 
word B m j at level m, since we can ignore the "new" appearances of B in B n ^ we see that B also has a 
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limiting frequency of appearance in B n _k as n — > oo, k/n — > a (and we can compute it, getting the same 
answer as before). 

4.3. Topological weak mixing. In this section we prove that the countable-substitution subshift version of 
the Pascal adic is topologically weakly mixing. Since the system is not minimal (for example, it contains the 
two fixed points of {a, 6} z ), it is not enough to prove that there are no nonconstant continuous eigenfunctions 
(cf [26]). Instead, we use a characterization of topological weak mixing provided by Keynes and Robertson 
[19] (see also [18]), along with Weyl's theorem on uniform distribution. 

In all the following (A, T) denotes a topological dynamical system, i.e. A is a compact metric space and 
T : X — ► X is a homeomorphism. Recall that (X, T) is topologically ergodic (i.e. topologically transitive) 
if there is a point x £ X with a dense orbit. Since the Pascal adic is ergodic for the Bernoulli measures 
fj, a , it follows that the (fully-supported) image measure </>(^t Q ) is ergodic for the substitution subshift, and 
therefore the system is topologically ergodic. Actually, if a; is a path in the Pascal graph which is not 
eventually diagonal, then (j){x) has a dense orbit in £, so in fact all but countably many orbits in £ are 
dense. We say that {X, T) is topologically weakly mixing if (X x X, T x T) is topologically ergodic. 

Definition 4.13. Let X be a topological space. Denote by C(X) the space of all continuous functions 
/ : X — > C, and by B(X) the set of bounded functions / : X — > C such that the set C(f) of points of 
continuity of / is residual. 

Definition 4.14. Let f,g£ B(X). We say that / and g are essentially equal, and write / = s g, if / = g on 

C(f)nc( g ). 

Remark 7. If /, g £ B(X), then / = g if and only if / = g on a dense set. 

Theorem 4.15 ([19]). Let (A, T) be a topological dynamical system. The following are equivalent: 

(i) (X, T) is topologically ergodic. 

(ii) For every f £ B(X), if f o T e = f , then f e = constant. 

(iii) For every f G B(X), if f oT = f (everywhere) , then f = constant. 

Remark 8. It is easy to show that if (X, T) is ergodic, and A £ C is an eigenvalue for some / £ B(X), then 
|A| = 1 and |/| = s constant. 

Definition 4.16. A Borcl probability measure /i on X is called closed ergodic for T if every closed invariant 
subset of X has /x- measure or 1. 

Here is a criterion for weak mixing that we will use to show that the substitution dynamical system (E, a) 
is topologically weakly mixing: 

Theorem 4.17 ([19]). Let (A, T) be a topological dynamical system and suppose there exists a T -invariant 
Borel probability measure fi supported on all of X which is closed ergodic. The following are equivalent: 

(i) (A, T) is topologically weakly mixing. 

(ii) For every f £ B(X), if there is X £ C such that / oT = A/ (everywhere), then f = constant. 

Every measure v a is ergodic (so in particular closed ergodic) and has support equal to S, so Theorem 
4.17 applies in our case. Given / £ fi(£) and A <G C such that / o a = A/ (everywhere), if we can show that 
A = 1 then, combining Theorem 4.15 and Theorem 4.17, one would show that (E, a) is topologically weakly 
mixing. 

We need the following lemma: 

Lemma 4.18. Let (X,T) be a topological dynamical system (with underlying metric d). Let f : X — > C be 
a function with C(f) ^ and such that f o T = Xf for some X £ C. Then every point x £ X with dense 
forward orbit and dense backward orbit is in C(f). 

Proof. Let x £ X be a point such that both semiorbits {T n x\n > 0} and {T n x|n < 0} are dense. Let 
z £ C(f). Since {T n x\n > 0} is dense, there exists m / oo such that T n *x ->■ z. Therefore \f o T n '(x)\ = 
\X\ ni \f(x)\ forces |A| < 1. Since {T n x\n < 0} is also dense, a similar argument shows that |A| > 1, so |A| = 1. 

Assume that x has both semiorbits dense and x ^ C(f). Then there exists a > such that for all S > 
there is y £ X such that 

(4.5) d(x,y) <<5and \f(x)-f(y)\ > a. 
Since z £ C(f), there exists -q > such that 

(4.6) if d(z,u)<r), then \f(z) - f(u)\ < a/4. 
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Since x has dense orbit we can find n £ Z such that d(T n x, z) < rf/2. By continuity of T n there exists 6 > 
such that 

if d(x,y) < S, then \T n x - T n y\ < n/2. 
For that previous S, by (4.5), there exists y such that 

d(x, y) <S and \f(x) - f(y)\ > a. 
Since A is an eigenvalue with modulus one we have 

|/ o T n x - / o T n y\ = \X n f(x) - X n f(y)\ = \f(x) - f(y)\ > a. 
On the other hand, since d(T n x, z) < r//2 and d(T n y, z) < rj, by (4.6) we get the following contradiction: 

|/ o T n x - f o T n y\ < \f o T n x - f(z)\ + \f(z) - f o T n y\ < a/2. 
Thus i e C(/). □ 

Theorem 4.19. The countable substitution dynamical system (£, a) is topologically weakly mixing. 

Proof. Let / G B(Tj) be such that / o a = Xf for some A 6 C. Assume that A ^ 1. Since there are no 
rational eigenvalues (see [27]), A — e 2 ' Kl ^ for some irrational number [3. Suppose that we can find u g S 
with the properties that: 

(i) lo has dense orbit, 

(ii) there exist Nk — » oo such that X k — ► — 1, 

(iii) a Nk uj — * uj. 

Then, by Lemma 4.18, since every point with dense orbit is a continuity point of /, the relation / o a Nk ui = 
X Nk f(oj) would lead to a contradiction. To find such a point uj we use Weyl's theorem on uniform distribution. 
Since (3 is irrational, for every k > 1 the set {C(n, k)(3 : n £ N} is uniformly distributed modulo 1. Therefore 
there exist n\ < ri2 < ■ • ■ < < . . . such that 



\C(n k ,k) _|_ ^ 



< — for all k. 



(4.7) 

Let x be the path in the Pascal graph defined by 

Since x is not eventually diagonal, uj = <fr(x) has a dense orbit in S, and hence condition (i) is satisfied. 
Let Nk = C(nk,k). Condition (ii) follows from (4.7), and condition (iii) is guaranteed by the Kink Lemma 
(Lemma 2.2). □ 

5. Questions 

Many properties of the Pascal adic and related systems remain to be determined. In particular, the 
question of weak mixing of the systems {X, T, /x p ) remains open. Since ergodicity of the Bernoulli measures 
fi p under the Pascal adic map T implies the Hewitt-Savage 0,1 Law, and weak mixing is stronger than 
ergodicity, results along these lines would have probabilistic implications. 

Recently the loose Bernoulli property of each (X, T, /i p ) has been established by dc la Rue and Janvresse 
[6]. We believe that these systems have infinite rank, and indeed that they do not have local rank one 
(see [10] for the definitions). Determination of the joinings, or even factors, of these systems would be of 
considerable interest, as would any more information about their spectra (simple? singular?). 

Dynamical properties of a class of adic transformations generalizing the Pascal adic, which code adic 
transformations on certain shifts of finite type (see [28]), are studied in a forthcoming paper by the first- 
named author [23]. 
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